Streaming Binary Sketching based on Subspace Tracking and Diagonal Uniformization

نویسندگان

  • Anne Morvan
  • Antoine Souloumiac
  • Cédric Gouy-Pailler
  • Jamal Atif
چکیده

In this paper, we address the problem of learning compact similarity-preserving embeddings for massive high-dimensional streams of data in order to perform efficient similarity search. We present a new method for computing binary compressed representations -sketchesof high-dimensional real feature vectors. Given an expected code length c and high-dimensional input data points, our algorithm provides a binary code of c bits aiming at preserving the distance between the points from the original high-dimensional space. Our offline version of the algorithm outperforms the offline state-of-the-art methods regarding their computation time complexity and have a similar quality of the sketches. It also provides convergence guarantees. Moreover, our algorithm can be straightforwardly used in the streaming context by not requiring neither the storage of the whole dataset nor a chunk. We demonstrate the quality of our binary sketches through extensive experiments on real data for the nearest neighbors search task in the offline and online settings.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Model Checking Markov Chains Using Krylov Subspace Methods: An Experience Report

The predominant technique for computing the transient distribution of a Continuous Time Markov Chain (CTMC) exploits uniformization, which is known to be stable and efficient for non-stiff to mildly-stiff CTMCs. On stiff CTMCs however, uniformization suffers from severe performance degradation. In this paper, we report on our observations and analysis of an alternative technique using Krylov su...

متن کامل

Binary Coding in Stream

Big data is becoming ever more ubiquitous, ranging over massive video repositories, document corpuses, image sets and Internet routing history. Proximity search and clustering are two algorithmic primitives fundamental to data analysis, but suffer from the “curse of dimensionality” on these gigantic datasets. A popular attack for this problem is to convert object representations into short bina...

متن کامل

Neural Network Performance Analysis for Real Time Hand Gesture Tracking Based on Hu Moment and Hybrid Features

This paper presents a comparison study between the multilayer perceptron (MLP) and radial basis function (RBF) neural networks with supervised learning and back propagation algorithm to track hand gestures. Both networks have two output classes which are hand and face. Skin is detected by a regional based algorithm in the image, and then networks are applied on video sequences frame by frame in...

متن کامل

Transmission subspace tracking for MIMO communications systems - Global Telecommunications Conference, 2001. GLOBECOM '01. IEEE

This paper describes the benefits of transmission subspace tracking for multiple input multiple output communications systems, and applies the concepts of previous work on adaptive transmit antenna algorithms to this problem. A specific stochastic gradient technique of subspace or “multi-mode” tracking of the independent modes of the MIMO transfer function and the application to space time codi...

متن کامل

Transmission subspace tracking for MIMO communications systems

This paper describes the benefits of transmission subspace tracking for multiple input multiple output communications systems, and applies the concepts of previous work on adaptive transmit antenna algorithms to this problem. A specific stochastic gradient technique of subspace or "multi-mode" tracking of the independent modes of the MIMO transfer function and the application to space time codi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1705.07661  شماره 

صفحات  -

تاریخ انتشار 2017